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Abstract 


The ability to adequately model risks is crucial for insurance compa¬ 
nies. The method of "Copula-based hierarchical risk aggregafion" (Ar- 
benz ef al. [1]) offers a flexible way in doing so and has affracfed much 
attention recenfly. We briefly infroduce fhe aggregafion free model as 
well as fhe sampling algorifhm proposed by fhey aufhors. 

An imporfanf characferisfic of fhe model is fhaf fhe joinf disfribufion of 
all risk is nof fully specified unless an additional assumption (known 
as "conditional independence assumpfion") is added. We show fhaf 
fhere is numerical evidence fhaf fhe sampling algorifhm yields an ap¬ 
proximation of fhe disfribufion uniquely specified by fhe condifional 
independence assumpfion. We propose a modified algorifhm and pro¬ 
vide a proof fhaf under cerfain condifions fhe said disfribufion is in¬ 
deed approximafed by our algorifhm. 

We furfher defermine fhe space of feasible disfribufions for a given 
aggregafion free model in case we drop fhe condifional independence 
assumpfion. We sfudy fhe impacf of fhe inpuf paramefers and fhe free 
sfrucfure, which allows conclusions of fhe way fhe aggregafion free 
should be designed. 
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Chapter 1 


Introduction 


Why modelling risk? From the perspective of an insurer or reinsurer the an¬ 
swer to this question is straightforward: Taking risks is their core business, 
and hence both their profitability and solvency critically depends on the abil¬ 
ity to adequately model these risks. Besides that, a proper risk management 
is also required by regulatory frameworks such as Solvency II and Basel III. 

Mathematically, risks can be interpreted as a multivariate random variable 
X = (Xi,..., X„), where the univariate random variables Xi,..., X„ repre¬ 
sent the individual risks (or marginal risks). Given that the individual risks 
often share common environmental and socioeconomic conditions, they are 
generally dependent. Therefore, most of the time knowledge of the joint 
distribution is required to be able to properly measure and allocate risk. 

Assume, for instance, an insurance company is interested in the total amount 
of claim payments in a given period in the future, i.e. the quantity X 0 := 
Xi + ... + Xn- In order to compute this quantity, it does not suffice to merely 
know the distributions of the individual risks. Presumably the most obvious 
way to determine the distribution of a sum of dependent risk is to first deter¬ 
mine the joint distribution function F(xi,..., x„) = P[Xi < Xi,..., X„ < x„] 
of the individual risks. Modelling this distribution accurately is a very chal¬ 
lenging task. Although the individual risks constituting the portfolio might 
easily be described with an appropriate stochastic model derived from data 
and/or expert opinion, it is often the case that very few joint observations 
are available, in which case the joint distribution of the individual risks is 
basically unknown [2]. 


Recently, copulas have become the privileged tool to overcome this diffi¬ 
culty. Readers who are not familiar with the theory of copulas will find 
a brief introduction to it in Section 2.1. For now, it is enough to think of 


1 



1. Introduction 


a copula as a multivariate random variable that describes the dependence 
structure between the individual risks. 

A well-known result in copula theory (see Theorem 2.2) states that the dis¬ 
tribution function F can then be written as 

F(xi,...,Xrf) = C(Fi(xi),...,Frf(xd)), 

where F,(x) = F[X, < x], i = are the marginal distribution func¬ 

tions and C : [0,1]” —> [0,1] is a copula function. In this way, we have 
separated the dependence structure from the margins, and the above prob¬ 
lem is reduced to find accurate models for the margins and the 

dependence structure described through the copula C. When it comes to 
choosing the right copula, we can rely on the broad set of different copula 
models that have been developed and studied in the past years. In particular, 
there exist asymmetric copulas and copulas with tail-dependence which try 
to reflect the effects that can be observed in practice. 

Nevertheless, the common parametric copula models are often problematic 
when used in high dimensions because the attainable dependence structures 
are limited. For instance, they often are too symmetric. 

A very elegant method to overcome the limitations arising in high dimen¬ 
sions is commonly referred to as "copula-based hierarchical risk aggrega¬ 
tion". The method has been used in the industry for more than a decade. 
Consider the following simple example where we present its general idea: 

Example 1.1 Assume we are given four different risks represented by the 4-dim. 
random variable X = Here Xi i stands for "Car insurance 

Switzerland", and Xi ^2 stands for "Car insurance Italy". Also, X 24 stands for 
"Earthquake Switzerland", and X 2,2 stands for "Earthquake Italy". In case we are 
interested in the aggregated risk X 0 := X 14 -|- Xi ^2 + X 24 -|- X 2 , 2 , we could try 
to find a model for the joint distribution function F of X by first modelling the 
marginal distributions Fi^, Fi^ 2 / ^ 2,1 emd F 2,2 of the individual risk and impose a 
4-dim. copula C between the individual risk. The joint distribution would then be 
given by F{xi,X 2 ,X 3 ,X 4 ) = C{Fi^i{xi),Fi^ 2 {x 2 ),F 2 ,i{x 3 ),F 2 , 2 {xi)), and the distri¬ 
bution of X 0 can directly be computed from F. 

Alternatively, we could in a first step model the distributions of the partial risks, 
(X 14 , Xi^ 2 ) and (X 24 , X 2 , 2 ), by combining the risks X 14 and Xi ^2 through a bivari¬ 
ate copula Cl, whereas X 24 and X 2,2 are combined through a bivariate copula C 2 . 
Knowing the distributions of the partial risks, we can then easily compute the dis¬ 
tribution of the partial sums Xi := Xi 1 -|- Xi ^2 and X 2 := X 24 -|- X 2 , 2 - The partial 
sums, Xi and X 2 , can then again be combined through an adequate bivariate copula 
model C 0 . Finally, this then allows us to compute the distribution F 0 of the total 
agg^^gate X0 := Xi -|- X2 = Xi 1 -|- X12 -|- X24 -|- X 2 ^ 2 - 

This procedure is known as "copula-based hierarchical risk aggregation" and is best 
illustrated by a so called "aggregation tree". The aggregation tree graphically rep- 
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Figure 1.1: An illustration of the 4-dim. aggregation tree model described in our lead Example 
1 . 1 . 

resents in which way the individual risks are tied together and which dependence 
structure is assumed to hold between them. Figure 1.1 shows the aggregations tree 
corresponding to the situation described above. 

The main advantage of hierarchical risk aggregation is that we do not need 
to specify the copula of all risks. We once again emphasize that it is ex¬ 
tremely unlikely to find a copula model which adequately describes the 
dependence structure between a large number of risks. Joint observations 
between all risks are too rare, and the attainable dependence structures of 
common parametric copula models are too limited. 

Instead, we aggregate the risks hierarchically, which requires only to specify 
the joint dependence between the aggregated sub-portfolios in the different 
aggregation steps. For these sub-portfolios we can use low dimensional cop¬ 
ulas, which are well-known to be more realistic measures of dependence 
and offer greater flexibility. In particular, this model allows to use differ¬ 
ent dependence characteristics (tail dependence, radial asymmetry, etc.) for 
each aggregation step. 

The model can be seen as a dimension reduction tool: complexity and the 
number of parameters can be adjusted to the necessary intricacy and avail¬ 
able information. As a logical consequence, the method does not specify the 
full joint distribution of the individual risks. In other words, there is in 
general more than one distribution which satisfies a given aggregation tree 
model. We will call those distributions "mildly tree dependent". 

In recent years, a number of papers have been published on this subject. Ar- 
benz et al. [1] were the first to provide a sound mathematical foundation for 
the copula-based hierarchical risk aggregation approach. They described the 
structure of the model in graph-theoretical terms and identified a condition 
under which it leads to a unique multivariate distribution. This condition 
is known as "conditional independence assumption" and the unique mul¬ 
tivariate distribution specified by it is called "tree dependent distribution". 
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They also provided a sampling algorithm together with a proof that the sam¬ 
ples generated by the algorithm approximate the aggregated sub-portfolios 
in the model (e.g., Xi, X 2 and X 0 in our lead Example 1.1). 

The papers of Cote & Genest [4] and Bruneton [2] focus on the structure of 
the aggregation tree. In fact a lot of freedom is simply hidden in the order 
in which the risks are aggregated together and in the general shape of the 
tree. Bruneton [2] argues that the diversification effect is larger for thin trees 
than for fat trees. Cote & Genest [4] suggest a procedure for selecting the 
tree structure, based on hierarchical clustering techniques. 

The method of hierarchical risk aggregation is particularly suitable if the 
ultimate goal is to find an approximation for the total aggregate of all indi¬ 
vidual risks. In this case, the fact that the proposed strategy does not lead 
to a unique joint distribution is not critical. 

In practice, however, we may as well be confronted with situations where 
the joint distribution of the individual risks is required. A popular example 
of such would, for instance, be risk allocation or - something more related 
to Example 1.1 - the situation where we want to estimate the total risk we 
are taking in Switzerland, i.e. the distribution of Xi 2 -|- X 24 . Unfortunately, 
the aggregation tree model in Eigure 1.1 does not uniquely specify this dis¬ 
tribution. 

The majority of the results and insights provided by the above mentioned 
papers cover the situation where the total aggregate X 0 is of particular in¬ 
terest. Concerning the joint distribution, there are a couple of interesting 
questions that have not yet been sufficiently addressed. This thesis aims to 
contribute to a better mathematical understanding of the joint distributions 
associated with an aggregation tree model. 

The subject of the first question is the sampling algorithm proposed by Ar- 
benz et al. [1]. As mentioned before, the samples can be used to approximate 
the distribution of the total aggregate X 0 of the individual risks. However, it 
is uncertain so far if the samples are also an approximation of a mildly tree 
dependent distribution. We will conduct a numerical experiment that sug¬ 
gests that the samples approximate the unique tree dependent distribution 
specified by the conditional independence assumption. This will encourage 
us to develop a modified sampling algorithm (MRA). Under the additional 
constraint of discrete marginals the MRA yields an approximation of the 
unique tree dependent distribution. 

It is, however, questionable if the conditional independence assumption is 
indeed a reasonable assumption in practice. If it is not, then determining the 
joint distribution approximated by a sampling algorithm is pointless. Even 
if we can determine it, it will hardly be an adequate representation of the 
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true joint distribution of the individual risks. 

Instead, it would be more advisable to gain awareness of the space of mildly 
tree dependent distributions and how it is affected by the different param¬ 
eters of the aggregation tree model and the tree structure. An interesting 
question, for instance, would be whether it is possible to narrow down 
the number of mildly tree dependent distributions by altering the order 
in which the risks are aggregated. We will address such questions in the 
second part of the thesis, where we study and determine the space of mildly 
tree dependent distributions for simple aggregation tree models. We believe 
that any insurer or reinsurer using such tools should be aware of the insights 
gained from this and that this awareness should strongly call for designing 
trees that adequately fit the business. 

Organisation of the thesis. Chapter 2 introduces the aggregation tree model 
with all the basic definitions and results. The sampling algorithm proposed 
by Arbenz et al. [1] and its convergence properties are described in Chapter 
3. In Chapter 4 we propose the modified sampling algorithm (MRA) and 
discuss its convergence. The space of mildly tree dependent distributions is 
studied in Chapter 5. Chapter 6 concludes the thesis. 
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Chapter 2 


Hierarchical risk aggregation 


This chapter introduces the basic definitions and concepts that play an im¬ 
portant role throughout this thesis. We give a short introduction to copulas 
in the first Section 2.1. In Section 2.2 we describe step-by-step the hierarchi¬ 
cal risk aggregation method. 

We deliberately choose to follow very closely the structure and terminology 
proposed by Arbenz et al. [1]. Readers who are fully familiar with their 
paper may as well skip this chapter. 


2.1 Copulas 

Copulas have become increasingly popular in the last decades both among 
academics and practitioners. They turned out to be a powerful and flexible 
tool to model dependencies between random variables. This is particularly 
useful in the insurance sector, where the dependence structure between the 
different risks is of greatest interest. 

The definition of a copula is surprisingly simple [8]: 

Definition 2.1 (Copula) A d-dimensional copula is a distribution function on 
[0, lY with standard uniform marginal distributions. 

We reserve the notation C(m) = C(wi,..., Wi) for the multivariate distribution 
functions that are copulas. Hence, C is a mapping of the form C : [0,1]“^ —> 
[0,1], i.e. a mapping of the unit hypercube into the unit interval. 


The importance of copulas is justified by Sklar's Theorem: 

Theorem 2.2 (Sklar 1959) Let F be a joint distribution function with margins 
Fi,..., F^. Then there exists a copula C : [0,1]“^ —> [0,1] such that, for all xi,...,Xd 
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in R = [— 00 , 00 ], 


F(xi,...,Xrf) =C(Fi(xi),...,F,(xrf)). (2.1) 

If the margins are continuous, then C is unique. 

Conversely, if C is a copula and Fi,...,Fd are univariate distribution functions, 
then the function F defined in (2.1) is a joint distribution function with margins 
h,- ■■ ,Fd. 

Sklar's Theorem can be interpreted in the following way: 

1. The first part of the theorem states that we can decompose any distri¬ 
bution function into its margins and a copula. This allows us to study 
multivariate distributions independently of the margins. 

2. The second part of the theorem states that copulas together with margi¬ 
nal distribution functions can be used to construct new multivariate 
distributions. 


2.2 The aggregation tree model 

In the Introduction we presented the idea of hierarchical risk aggregation, 
and we learned that this method is graphically best represented by a so 
called aggregation tree. It is our aim to introduce a couple of definitions and 
notational conventions in order to describe the hierarchical risk aggregation 
model in an sound mathematical way. We will closely follow the terminol¬ 
ogy suggested by Arbenz et al. [1]. 

Rooted tree 

It seems reasonable to adopt part of the terminology from graph theory to 
describe the tree structure. We will, therefore, first introduce the notion of 
a rooted tree. A rooted tree consists of branching nodes and leaf nodes, with 
one distinct branching node being the root. The root node is denoted by 
0 and the other nodes by tuples (h,.. .,id) of natural numbers ij G N. If 
a node (z'l ,... ,id) G is not a leaf node, it branches into a number of 
^ ^ children, which are denoted by the {d -|- l)-tuples (z'l,..., id, 1 ), 
ih, ...,id,2),..., {h,..., id, .) G N‘^+1. 

Definition 2.3 A finite set r C {0} U denotes a rooted tree if 

1. the root 0 is contained in r, 

2. for each node I = (z'l ,... ,id) G t, the number of children is given by 
Ni G No, i.e. the node I has a child (ii,... ,id,k) G r if and only if 
A:G{nGN:l<n< Nj}, 
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3. each node (z'l,..., G t has a parent node represented by (z'l, ..., id-i) S 

T. 

Example 2.4 Let r = {0, (1), (1,1), (1,2), (2,1), (2,2), (2)}, as illustrated 
in Figure 2.1. Note that r defines a tree of the structure as used in Example 1.1. 


( 2 , 2 ) 

Figure 2.1: An illustration of the rooted tree t = {0, (1), (1,1), (1,2), (2,1), (2,2), (2)}. 

We define the following subsets for every rooted tree t: 

1. Leaf nodes are denoted by .if(T) = {J G t : Nj = 0}. 

2. Branching nodes are = {1 G t : N; > 0} = t \ .if (t). 

3. Children of node I = {ii,...,id) ^ 1^{t) are defined as ^(1 ,t) = 

{{lA) . ilNi)}. 

4. Descendants of node I = (z'l,...,)^) G t are defined as ^(J,t) = 
Uh'-'-Jk) G T : /c > d. Hi,...,id) = {ii,...,id)}- 

5. Leaf descendants of node 1 G t are defined as .^S>{I,t) = .if(T) n 
^(1 ,t). 

6 . The number of leaf descendants of node J G t is denoted with M; = 
#.^^(J,t). 

Aggregation tree 

This section introduces a risk aggregation approach building upon a given 
rooted tree t. We define on some probability space (TT, . 2 /, P) a random 
vector {Si)ieT that assigns to each node J G t a random variable Xj : O —f R 
such that 

• for leaf nodes I G -Sf(T), the Xj represent the risks whose aggregate 
we are interested in, 

• branching nodes Xj, I G =^(t), are given by the aggregation of their 
children: X; = X/1 + ... + X/^Nj . 

In the following, we will further use the convention that whenever we write 
Xi, I G .^(t), in bold letters we mean the random vector defined by 

X: := (Xj)igjf'^(j^T-)/ 
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with components Xj, for / G where the components are lexico¬ 

graphically ordered with respect to their indices /. 

For ease of notation, we drop brackets of index vectors as well as the argu¬ 
ment T whenever the meaning is clear. For instance, = Xi i, = 

^(J) and ^^((1,1), t) = ^^(1,1). 

The rooted tree defined in the previous section represents the hierarchy in 
which the individual risks are aggregated. In order to complete the aggre¬ 
gation tree model we additionally need to define marginal distributions F/, 
I G -S^’(t), and the dependence structure between the children of a branch¬ 
ing node I. The dependence structure is given through copulas C/, I G ^(t). 

In summary, the aggregation tree model is given by the triple 

(''■/ (C)/6 ^(t)) (2 .2) 

consisting of 

• a rooted tree t; 

• distribution functions F/ : R —> [0,1] for all I E 

• copulas Cl : [0,1]^^ —)■ [0,1] for all I G 

Example 2.5 In our lead Example 1.1, the triple (2.2) would incorporate the tree 
T = {0, (1), (1,1), (1,2), (2,1), (2,2), {2)}; four univariate distribution func¬ 
tions Fi^i, Fi^ 2 , ^ 2,1 ‘^nd F 2 , 2 ; and three bivariate copulas Ci, C 2 and C 0 . 

Mildly tree dependent and tree dependent random vectors 

Based on the triple (2.2), we characterize a class of distributions for a random 
vector {X)ifZT^. 

Definition 2.6 For a given triple (2.2), (X)/gT- is called mildly tree dependent if 
the following three conditions are met. 

• For each leaf node I G the random variable Xj has a distribution Fj; 
Fi{x) = P[Xi < x] for all x G R. 

• For each branching node 1 E Xi is the sum of its children, i.e., Xj = 

The marginal density function ofXj is denoted by Fj : R —> [0,1]. 

• For each branching node 1 E 1^, the dependence structure of its children ^ is 
given by the copula Cj, i.e. 

< ^1, • • •, X;,Nj < ^Nj] = Ci{Fi^i{xi),. . . ,Fi^i^j{X]sij)), 

for all (xi,.. .,xnJ G R^G 
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The following example illustrates a mildly tree dependent random vector. 


Example 2.7 Consider the aggregation tree model yr, (f7)j6^(T)/ (C) 76 ^(t) j /rom 
our lead Example 1.1, which was specified in Example 2.5. Suppose (X;)jgT: = 
(X 0 , Xi, X 14 , Xi^ 2 / ^ 2 , 1 / ^ 2 , 2 / X 2 ) is mildly tree dependent for the model 




P[Xl,l <x]= P[Xi ,2 < x] = Fi, 2 (x), 

F[X2,1 < x] = F2,i(x), F[X2,2 < x] = F2,2{x), X G K. 


Recall that X]^ — Xi^i T" X 2 ^ 2 / X 2 — X 2 ^i ^ 2,2 and X 0 — X^^ -t- X 2 . Ehe copulas 
Cl, C 2 and C 0 determine the dependence structure of (X 14 , Xi^ 2 ), (X 2 ,i/ ^(. 2 , 2 ) t^nd 
(Xi,X 2 ), respectively: 

P[^l,l < Xi,Xi^2 < X 2 ] = Ci(Fi4(Xi),Fi^2(X2))/ 

^’[X2,l < Xi,X2,2 < X2] = C2(F24 (Xi),F2,2(X2)), 

F[Xi < Xi,X2 < X2] = C0(Fi(xi),F2(X2)). 


Recall that Figure 1.1 illustrates the aggregation structure. 


By Sklar's Theorem 2.1 it should be clear that in the above Example 2.7 
the distribution of the random vectors (Xi i, X 12 ), (X 24 ,X 2 , 2 ), (Xi,X 2 ), and 
hence also of the aggregated risk X 0 = Xi + X 2 , is uniquely specified 
through the aggregation tree model (t, (Fi)jg_ 5 ^(T-), (C)jg^(T-)). We will state 
a general result in the next section. 

On the other hand, as mentioned already in the Introduction, the joint dis¬ 
tribution of all risk (Xi i,Xi^ 2 /X 24 ,X 2 , 2 ) is not uniquely specified. Put dif¬ 
ferently: given a aggregation model (t, {Fi)i^^(r)f there exists in 

general more than one mildly tree dependent random vector (Xj/g^- 


Arbenz et al. [1] propose an additional condition which makes the joint 
distribution unique. The condition is formulated in the following definition. 


Definition 2.8 For a given triple (2.2), a mildly tree dependent random vector 
(XjigT is called tree dependent if for each branching node 1 G =^(t), given Xj, 
its descendants (Xjjjg^p) are conditionally independent of the remaining nodes 

(X/)/6t\^(7)- 

(X/)/ 6 ^( 7 )-L(Xj)|gT-\^(j) I X; for all I e ^{t). (2.3) 

In the sequel, (2.3) will be referred to as the conditional independence assumption. 

Example 2.9 Considering once again the aggregation model from our previous ex¬ 
amples, the conditional independence assumption would read: 

(Xi,i,Xi,2,Xi)±(X24,X2,2,X2,X0) I Xi, 

(X2,l, X2,2, X2 )-L(Xi4,Xi^2/Xi,X0) I X2. 
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One can show that under this additional condition, a tree dependent random vector 
exists and is unique. We will state the general result in the next section. 

The conditional independence assumption (2.3) may seem quite confusing 
at first glance, and it is questionable whether it is a reasonable assumption 
in practice or not. We will come back to that later on in Section 4.3. 

2.3 Existence and uniqueness 

In this section we address existence and uniqueness of the distribution of 
tree dependent and mildly tree dependent random vectors. 

Recalling again the definitions of tree dependence and mild tree dependence, 
it is clear that mild tree dependence covers a larger space of multivariate 
distributions than tree dependence. Any tree dependent random vector is 
mildly tree dependent, whereas the converse does not hold. 

Given an aggregation tree model the existence 

of a mildly tree dependent random vector (X)iQr is fairly obvious by Sklar's 
Theorem 2.1. And we have already mentioned before that in general the 
distribution of a mildly tree dependent random vector is not unique. Con¬ 
sidering the smaller class of tree dependent random vectors existence is non- 
obvious. Arbenz et al. [1] proved that among the mildly tree dependent 
distributions there exists indeed exactly one distribution that satisfies (2.3): 

Theorem 2.10 Given an aggregation tree model (2.2), a tree dependent random 
vector (X)iQr exists and its joint distribution is unique. 

Proof See [1] p. 125. □ 

We also state the following result regarding an important property of mildly 
tree dependent random vectors. 

Proposition 2.11 Given an aggregation tree model (2.2) and some fixed 1 E 
Then the distribution of the vector (X 74 ,... ,X/^Nj) Is equal for all mildly tree de¬ 
pendent vectors (X)jQT^. 

Proof Simple application of Sklar's Theorem 2.1. See [1] p. 126. □ 

In particular, by Proposition 2.11 the distribution of the total aggregate X 0 
is the same for any mildly tree dependent random vector (X) 7 gT- 
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Chapter 3 


The sample reordering algorithm 


As in most cases the quantities of interest (e.g., the distribution of X^) of an 
aggregation tree model cannot be calculated analytically, Arbenz et al. [1] 
propose a sampling algorithm for the numerical approximation of mild tree 
dependence. The samples obtained by the algorithm can then, for instance, 
be used to approximate the distribution of the total aggregate X 0 . 

In this chapter we first present the sample reordering algorithm proposed 
by Arbenz et al. [1]. We then state some convergence results and conduct a 
numerical example at the end of this chapter that raises new questions and 
motivates the subsequent chapters. 

3.1 Definition of the algorithm 

The sampling algorithm by Arbenz et al. [1] uses a bottom-up approach to 
achieve a numerical approximation of mild tree dependence. The basic idea 
can be summarized as follows: 

1. Simulate independent marginal samples for all leaf nodes X;, I G 

2 . simulate independent copula samples from C/ for all branching nodes 
f G 

3. for f G recursively define approximations of the distributions of Xj 
and (X/ 4 ,..., Xj^nJ based on the empirical margins and the empirical 
copulas from steps 1 to 2 . 

For ease of notation, let Gj : > [0,1] for I G ^ denote the joint distribu¬ 
tion function of (Xj^,..., the children of Xj: 


Gi{xi,. . . ,XNj) = P[^I,l < Xi, . . Xj^Nj < Xn,] 
= C/ (Fj 4 (xi ), . . . , {xNi ) ) • 
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Algorithm 3.1 Fix n G N 

1. Generate n independent samples from the leaf nodes X;, I G Jf, and 
the copulas Cj, I E ^ 

• ^ FiJork = l,...,n, 

• Iff = Uf^') ~ Cj, for = 1,, n. 

2. Define empirical margins F” : R —> [0,1]/ f £ by 

f;’w = jEi{xf<*)' 

and empirical copulas C” : [0,1]^^ —> [0,1], I E by 
1 ^ 

Cj{u) = - 'I, for M = (mi,. . .,mn,) e [0,1]^F 

^k=i . ^<unA 

where rY is the rank of UY within the set | uY | : 

3. For I E SS, recursively define as approximations of Gj and Fj: 

G?(xi.XN,) = C?(F,^i(xi). F,%(xn,))z 

for (xi,... ,Xnj) G R'^ and 

F”(f) = f l{xi + ... + XjVj < f}dGj (xi,... ,Xfvf;)/ 

JR^l 

for f G R. 

It turns out that applying empirical copulas to empirical margins, as used 
in the definition of the G'f, can be efficiently represented in terms of sample 
reordering. The idea goes back to Iman and Conover [6] and was adapted 
by Arbenz et al. [1] to this context: 

Theorem 3.2 In the following, a permutation denotes a bijective mapping from 
{1,2,...,n} to {1,2,...,n}. 

• Let the permutations pi for I E t\0 be defined through Pii{k) = R^f, 
k = l,...,n. 
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Recursively define samples X\, k 


x;= E X 


(P/W) 


= 1, ... ,n,for 1 E ^hy 

_ y(PuW) I I y(PPN,W) 

— Aj j + . . . + Aj , 


w/zere X® 


denotes the k-th order statistic o/ |x|,. 

x}^^ < xf... < x\"\ 



Then Gj,for I E and Ff,for I E r, satisfy 


G;(xi,...,xn,) = ^ El 


Api,m) 


1 Ki SXiforall i=l,...,N, 




k=l 


(3.1) 

(3.2) 


Proof See [1] p. 127. 


□ 


Algorithm 3.1 can now be formulated in terms of sample reorderings. We 
start with simulating leaf nodes and copulas independently, thus obtaining 
samples X\, k = 1,... ,n, for 1 E and U\, k = 1,... ,n, for I E We then 
use the permutations pij to link the appropriate order statistics and define 
the atoms of G" by 


( Y^Pl, ylP/.NfW) 
1 ^ 7,1 ' • • • ' ^I,N, 


k = l,...,n 


(3.3) 


Hence, the permutations pi can be used to introduce the correct dependence 
structure into originally independent samples by joining the appropriate or¬ 
der statistics. For branching nodes I E .AS, the component-wise sum then 

defines the atoms of Fj as X\ = We iteratively 

repeat this reordering from the bottom to the top of the tree. An illustrating 
example can be found in [1]. 


We will later on propose an alternative way to reorder the samples. The 
alternative way seems less natural and is notationally less comprehensible 
but yields exactly the same atoms, so that the statement of Theorem 3.2 still 
holds true. In addition, it allows us to draw some conclusions regarding 
the joint distribution of the mildly tree dependent vector approximated by 
our modified algorithm. Having said that, we go on by discussing some 
convergence results in the next section. 


3.2 Convergence result 

In this section we state the main convergence result on Algorithm 3.1. We re¬ 
strain ourselves, at this place, to provide an in-depth deduction and analysis 
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of the following result, and we refer to Mainik [7] for a detailed deduction 
in a much more general setting. 


In order to formulate the results, we need some additional notation. For 
continuous margins and f G ]R define Bj{t) G [0,1]^' as 

: (^1/- • e IR^hxi + ... + X]si, = f| 

Let Us{Bi{t)) denote the ^-neighbourhood of Bj(f) in [0,1]^': 

Us{Bi{t)) = |a G [0,1]^^ : ||a — 1/II2 < S for some y G B 7 (f)| . 

For absolutely continuous copulas C/, we denote the density by Ci : [0,1]^' —)■ 
[0,oo). 

Theorem 3.3 Assume that for all I ^ the copulas Cj are absolutely continuous 
and satisfy 


lim sup / Cj(mi, ..., W]v,)dMi ■ ■ ■ diijv/,= 0. 

tgR M(B,(f)) 

Then, for each branching node 1 E 


lim sup IG; (a) — G/(a) I = 0 P — a.s, 

xeR’^i 


lim sup|Ff”(f) 


Fi{t) \ =0 P — a.s. 


Proof See Mainik [7]. □ 

It is crucial to grasp the meaning of Theorem 3.3 correctly. Theorem 3.3 does 
not make any statement whether the reordered samples from Algorithm 3.1 
approximate a (mildly) tree dependent random vector. Instead, it simply 
tells us that as long as the conditions on the copulas C/ are satisfied, the 
samples approximate the distributions Gj, I E which characterize all 
mildly tree dependent distributions. In particular, the samples approximate 
the distribution F 0 of the total aggregate X 0 . 


3.3 Numerical experiment 

In the previous sections we introduced the hierarchical aggregation tree 
model. So far, papers on this subject focused on the total aggregate X 0 as 
the quantity of interest. Arbenz et al. [1] proposed the reordering Algorithm 
3.1 (incl. Theorem 3.2), which yields samples that approximate the distribu¬ 
tion of the total aggregate (recall Theorem 3.3). Unaddressed remained the 
question whether the samples actually approximate a mildly tree dependent 
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or even tree dependent random vector. 

In the following we will now shift our focus towards this relevant question. 
We emphasize that what follows has not yet been addressed in any previous 
paper and goes beyond the current theoretical results on this subject. 

We start the discussion with a numerical motivational example. The result 
of the example is surprising and gives a first hint what the answer to our 
question might look like. 

Defining the model 

To start with, the aggregation tree model (f7)j6^(T)/ {C)ie^{T)^ needs to 
be specified. We consider once again the situation from our lead Example 
1.1. By Example 2.5, we know that in this set up the aggregation tree model 
incorporates the tree T = {0, (1), (1,1), (1,2), (2,1), (2,2), (2)}; four uni¬ 
variate density functions F14, Fi,2, Fi,! and ¥ 2 ^/ and three bivariate copulas 
C\/ C 2 and C0. 


Suppose Fi 1, Fi 2, F24 and F2,2 are given by normal distributions with means 
4,2,0,3; and variances 3, 4, 10, 2, respectively: 

Suppose further that Ci, C2 and C0 are normal copulas with correlation 
matrices 


Ri 


1 0.7 \ 

0.7 1 J ' 


R2 


1 0.5 \ 

0.5 1 J' 


R0 


1 0.2 \ 

0.2 1 y ■ 


Note that in our model all leaf nodes are normal, and all copulas are normal 
copulas. Hence, we are dealing with a Gaussian aggregation tree model. 
Gaussian aggregation tree models have the convenient property that the 
unique tree dependent vector (X/)/^!- is multivariate normal distributed and 
can be explicitly calculated by a simple recursion formula (see Proposition 
2.8 in [1]). 

For the tree dependent random vector (Xj)7gT, we deduce by Proposition 2.8 
in [1] that the distribution of X0 = = (Xi,i, Xi^2, X24, X2,2) ~ 

N{}i 0 ,T, 0 ) is multivariate normal with 





( 3 

2.4249 

0.9502 

0.3290 \ 


2 


2.4249 

4 

1.1254 

0.3896 

F0 — 

0 

, Tj0 — 

0.9502 

1.1254 

10 

2.2361 


V 3 ) 


\ 0.3290 

0.3896 

2.2361 

2 / 


17 



3. The sample reordering algorithm 


Numerical result 

We will now apply the reordering algorithm on our aggregation tree model 
specified above. Let be the tree dependent random vector and note 

that by the convergence result stated in Theorem 3.2 we know that the re¬ 
ordered samples approximate the distributions of 

. Xi = (Xi,i,Xi,2) and X2 = (X2,1,X2,2); 

• X\ = \ Xi 2 and X 2 — X 2 ^i -I- X 2 ^) 

• (Xi,X2) and the total aggregate X0 = Xi -|- X2. 

Still unclear is whether the samples provide an approximation of the tree de¬ 
pendent random vector characterized by the distribution of X0 ~ X0). 

In order to investigate this question, we compute the sample covariance 
matrix £0 of the reordered samples for a sample size of n = 10^: 

/ 2.9985 2.4252 0.9513 

^ _ 2.4252 4.0025 1.1278 

^ “ 0.9513 1.1278 9.9978 

V 0.3301 0.3909 2.2337 

It is remarkable that the sample covariance £0 of the reordered samples and 
the covariance matrix X0 in (3.4) of the tree dependent random vector are 
almost identical. This is a strong indication that in our example the reorder¬ 
ing algorithm yields samples that approximate the unique tree dependent 
random vector. 

To back up our hypothesis, we also conduct a multivariate normality test. 
Recall that by Proposition 2.8 in [1] the tree dependent random vector is 
multivariate normally distributed. There are many test for assessing mul¬ 
tivariate normality, we choose to apply "Henze-Zirkler's Multivariate Nor¬ 
mality Test" [10]. At the 0.05 significance level the test does not reject the 
null-hypothesis of multivariate normality, and the p-value associated with 
the Henze-Zirkler statistic is 0.7848365. 

The numerical results suggest that the reordered samples approximate the 
unique tree dependent random vector. Naturally, this begs the question 
whether this holds true in general or everything was just coincidence. We 
will address this question in the next chapter. 

Concluding remark 

One might wonder if the above questions are actually relevant. Do we really 
need to bother ourselves with characterizing the multivariate distribution 
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approximated by the reordered samples? Is it not enough to know that the 
total aggregate X 0 is approximated correctly? 

Consider, for instance, the above example and recall that in Example 1.1 we 
said that Xi 1 stands for "Car insurance Switzerland", Xi^2 stands for "Car 
insurance Italy", X24 stands for "Earthquake Switzerland" and X2,2 stands 
for "Earthquake Italy". Assume an insurance company would not be just 
interested in the total aggregate but also in the total risk they are taking 
in Switzerland, i.e. Xi 1 + X24. This distribution is not uniquely specified 
by the aggregation tree model illustrated in Eigure 1.1, and so we most 
likely wish to have certainty about which distribution is approximated by 
a sampling algorithm. We see that it is definitely worthwhile to clarify the 
questions that have arisen in this section. 
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Chapter 4 


Tree dependent sampling 


We have seen in the end of the previous chapter that some issues regard¬ 
ing the reordering algorithm have not yet been addressed sufficiently. In 
particular, it is not clear whether the samples approximate the unique tree 
dependent distribution or some other distribution that remains to be deter¬ 
mined. The numerical experiment in Section 3.3 suggests that indeed the 
tree dependent distribution is approximated. 

In this chapter we propose a modification of the reordering algorithm for 
which we claim that for discrete marginals we obtain approximations of the 
tree dependent distribution. In addition, our modified algorithm has the 
convenient property that it generates i.i.d. realizations; however, at the cost 
of a worse algorithmic efficiency. 

We begin the chapter by presenting the modified algorithm. The proof of 
the above claim will then be the subject of the second part of this chapter. 


4.1 A modified reordering algorithm 

Based on Arbenz' reordering algorithm (Algorithm 3.1 & Theorem 3.2) we 
propose a modified reordering algorithm (MRA). The modifications will 
play an important role in the proof of Theorem 4.5, which we state in the 
next section. 

The first modification concerns the order the realizations are linked to each 
other. Have a look at the following Example 4.1 where we illustrate what 
we mean by this. 


Example 4.1 Consider the simplest tree r = {0, 1, 2}, with two margins Xi ~ 
Fi, X 2 ~ F 2 and a dependence structure given by a copula U 0 ~ C 0 . Suppose the 
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simulation for n = 3 yields the following Xf X\ and U%: 

X| = 1, X? = 4, Xf = 2, 

Xi = 9, Xl = 0, X3 = 3, 

Ul = (0.6,0.8), Ul = (0.3,0.7), Ul = (0.5,0.1). 


Recall that we denote by rY the rank of uY within the set 
of the copula samples are 


I Uf I . The ranks 


( 4 '. 4 ') = ( 3 , 3 ), 


(R^/,R^/) = (1,2), 


(R^/,R^/) = (2,1). 


With Pii{k) := rY the permutations are 

P 04 : (1,2,3) ^ (3,1,2), p 0,2 : (1,2,3) ^ (3,2,1). 

Furthermore, we denote by (resp. the rank of Xj (resp. X^l within the 
set (resp. and define the permutations q 0 ,i{k) := 

q0,2{k) '■= Q^Y- 


^ 0,1 ■ (1/2,3) I—7- (1,3,2), q 0^2 ■ (1/2,3) 1 —> (3,1,2). 

We will also use the notation of the inverse permutations P 0 \, pYi' ^ 0 \ ^ 02 - 


In the following we present two different reordering orders. We start with the more 
natural one proposed by Arbenz et al. [1] in Theorem 3.2. 


Reordering 1 (Arbenz) Let r^X^, ReiX|„ ReiX|, denote the reordered vectors. 
The subscript "Rel" indicates that "Reordering 1" was applied. Following the 
reordering order from Theorem 3.2 we get 


Rel 




^(P0,iW) 


Al 

A 2 


fork = 1,.. .,3. 


The figure below illustrates how the samples are linked to each other (X\ top row, 
U% middle row, X 2 bottom row): 



Reorcjering 1 


x' = 

Re1'^0 



Re1 





2 

0 
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Reordering 2 In the above "Reordering 1 " we linked the appropriate order statis¬ 
tics to the ranks associated with the components of U% in order to obtain ReiX%. 
This procedure is illustrated by the arrows pointing from U% towards the appropri¬ 
ate marginals. 

An alternative way to reorder the samples would be to set the first component of 
Re2X% equal to Xj and then link the appropriate second component to it: 



Reordering 2 


x' = 

Re2'''0 



Re2 





Note how we changed the direction of the arrows in order to illustrate the difference 
between the two reordering orders. Mathematically, Re2X% can be written as 

Re2X0 := ^ ^qf-;i{v0.2{p0\{qiN))) ^ for k = 1 ,... , 3 . ( 4 . 1 ) 

It is clear that both reordering orders yield the same atoms, i.e. the sets 
{ReiX0,Rrt X%,Rei X%} and {Re2X0,Re2 X%,Re2 X%] Contain the same elements 
but in general in a different order. As a consequence, the "modification" has 
in fact absolutely no impact on the convergence result in Theorem 3.3. 

The second modification may be seen as the major modification. It aims 
at changing the algorithm in such a way that it yields i.i.d. realizations. 
Let us focus for a moment on the random vectors Re2Xf,Re2 X%,Re2 X% in 
( 4 . 1 ) and note that these are indeed random vectors consisting of the i.i.d. 
random variables XfXfXf ~ Fi and X2,X2, ~ Tz- By a simple symme¬ 

try argument it is clear that Re2X%,Re2 X^,Re2 X^ are identically distributed. 
They are, however, as Arbenz et al. pointed out already correctly, not inde¬ 
pendent, and hence the reordered random vectors Re2X%,Re2 X%,Re2 X% are 
not i.i.d.. 


To get around this limitation we could proceed as follows: Conduct the 
algorithm three times in order to obtain three independent copies of the re¬ 
ordered samples. We denote the first copy by |Re2^0/Re2X^, l^e 

second one by {rJX^,j^JX0,rJX 0}, and the third one by {rJX^,kJX^, 

Re2^l} and set 


Re2X0 Re2^0/ 


2 2 ^ 
Re2X0 := Re2^0/ 


R. 2 X 3 := j,lxl. ( 4 . 2 ) 
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The so defined reordered random vectors Re 2 ^ 0 fRe 2 ^%,Re 2 are now clearly 
i.i.d.. Note, however, that this comes at the cost of a worse algorithmic effi¬ 
ciency 


Having discussed the two major modifications, we are now ready to for¬ 
mulate and understand the MRA. Unfortunately, the algorithm becomes no- 
tationally extremely complex due to the mentioned modifications. Instead 
of struggling with the notation, we suggest to rather focus on the general 
idea presented in the previous discussion and to have a look at the explana¬ 
tory Remark 4.3. 


In the following, a permutation denotes a bijective mapping from {1,2,..., n} 
to {1,2,...,n}. 

Algorithm 4.2 (Modified reordering algorithm - MRA) Fix n G IN. 

The samples are recursively defined from the bottom to the top of the tree. 

1. Generate n x n independent samples from the leaf nodes Xj, I G .if, 
and the copulas Ci, I E ^ 

. ^Xj~F;,for G {l,...,nf. 


. = {£,k) G 

Recall that we denote by ^rY the rank of ^UY within the set . 


Let for J G T \ 0 the permutations ^pi, I = 1,.. .,n, be defined through 
^Pi,iik) = ^rY> k = 1 ,... ,n and denote by ^pjl{-) their inverses. 

2. Recursively define for £ = 1,... ,n samples ^Xj, k = 1,... ,n, I G by 




— ^ 1,1 + • • • + 


(4.3) 


and samples by 

k_ (i^k eYA,2(‘piA‘pY,liAiAA))} 


£A.i — ( A.JI, A.J2 


I,Ni 


(4.4) 


where the permutations ^qj are defined through ^qj{k) = ^Qj and 


denotes the rank of ^X, within the set 


/ 


{Ni;, 


3. For k = 1,... ,n set 


vk _ vk 

^1 ■— k^lf 

X k _ \rk 

I ■— k-^I- 
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4.2. Convergence results 


4. If we haven not yet reached the root 0, repeat the algorithm up to this 
point n different times to generate n independent copies 

• of the reordered samples 

V J V J fc— 1 

• ^ of the reordered samples I Xj|^ 

5. Repeat steps 2-4 till you reach the root node 0 of the tree. 

Remark 4.3 

1. In step 2 of the MRA, for fixed i, we simply define recursively the reordered 
samples. The reordering order is chosen in such a way that the first component 
of the reordered samples is fixed (compare "Reordering 2" in Example 4.1) 

2. Since £ is running from £ = 1,... ,n, we obtain n independent sets of re¬ 
ordered samples. In step 3 we use a diagonalizing procedure similar to (4.2), 
which yields n i.i.d. samples. 

3. In case we have not yet reached the root node, we need n independent copies 
of these i.i.d. samples, so that we can continue with the next recursion step. 
These independent copies are generated in step 4. 

4.2 Convergence results 

In this section we discuss the convergence properties of the MRA. First, we 
show in Subsection 4.2.1 that the results in Theorem 3.3 are also true for 
the MRA. In addition to this, the MRA allows us to prove a result that goes 
beyond the ones already existing. We state and prove this convergence result 
in Subsection 4.2.2. 

4.2.1 The basic convergence result 

Due to the modifications carried out it is not a priori clear that the conver¬ 
gence result in Theorem 3.3 are still true for the MRA. We mentioned in the 
previous section that the first modification - namely the change of reorder¬ 
ing order - does not affect this result, since we obtain the same reordered 
samples (possibly in a different order). 

The second and more significant modification may, however, affect the re¬ 
sult. Fortunately, we can show that this is not the case when we additionally 
assume that the marginals are discrete (finite or infinite). This will be the 
subject of this section. 

Let in the following Gj and F", £ = 1,... ,n, for 1 G denote the empirical 
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distribution functions defined through 


1 ” 

..N,}' 

L"(^) = - T^{xf<*}- 

Jc=l *- -■ 


(4.7) 

(4.8) 


Since especially the first expression looks confusing, we want to empha¬ 
sis that these two expressions are nothing else than empirical distribution 
functions of the MRA-reordered samples (analogously to the expressions 
(3.1)&(3.2) in Theorem 3.3) 


Theorem 4.4 Assume that the conditions on the copulas Cj, f G as formulated 
in Theorem 3.3 are satisfied and that all the marginals are discrete. Then, for each 
branching node I E 

lim sup IGj (a) — Gj(x) I = 0 P — a.s., (4.9) 

lim sup \ Ff{t) — Fi{t) \ = 0 P — a.s. (4.10) 

teR 


The following auxiliary results will help us to prove Theorem 4.4. 

Auxiliary result 1 Let X„ : Q —> R, n G N, and X : Q ^ R be discrete 
random variables such that X„ X in distribution. Denote by ranX = 

{X(a;)|cu G Q} the range of X and assume there exists a discrete set E C R 
such that ranX C E and ranX„ C E, Vn G N. 

Then it holds that lim P\X„ = xl = P\X = x], Vx G R. 

n^co 

Proof Let first x G ranX. For a left-open, right-closed interval U = {a, b] it 
holds that 


lim P[X„ eU]= P[X G U] 

n^oo 

by convergence in distribution. Since E is discrete we can find U = {a, b] 
such that a < X < b and E Dll = {x}. Then it holds that 

P[X„ G G] = P[X„ = x], P[Xg G] =P[X = x]. 

If X ^ ranX we argue similarly. Find G = {a, b] such that a < x < b and 
E n G = 0. Then 0 = lim P[X„ = xl = lim P[X„ G Gl = P[X G Gl = 

n^oo jn-oo 

P[x = x] = 0. □ 
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Auxiliary result 2 Let X and X„, n G N, as in the 1®* auxiliary result above. 
Denote their distribution functions by F, respectively F„, and the associated 
measures by }i, respectively }i„. Then the convergence in distribution is 
uniformly in the sense that 

lim sup |F„(x) — F(x) I = 0. (4.11) 


Proof Let e > 0. Since X is discrete there exists a finite set A C E, of 
cardinality |A| < oo, such that P[X G A] = }i{A) > 1 — e. Choose Ni G N 
large enough such that Vn > Ni: |/^„(R\A) —/^(R\A)| < e. Then the 
triangle-inequality implies that Vn > Ni: 

|/in(R\A)| <£+|;^(R\A)| <2e. 

By the 1®* auxiliary result above we know that lim Unix) = lim P\X„ = 

n^oo n^oo 

x] = P[X = x] = }i{x), for all x G A. This, and the fact that A is finite 
implies that there exists N 2 G N such that Vn > N 2 : \pn{x) — ji{x)\ < e, 
Vx G A. 

Let now x G R be arbitrary and set B := (— 00 , x] fl A and := (— 00 , x] \ A. 

Then Vn > max {Ni, N 2 }' 

\Fn{x)-F{x)\ = \M^) + 

< \}in{B) - + \ii„{B^) - f(B‘')| < |A| e + 2£. □ 


Auxiliary result 3 Let X and X„, n G IN, as in the 1®* auxiliary result above. 
Denote their distribution functions by F, respectively F„. Let Fm,n{x) := 

m 

4{x',<x}/ G N^, denote the empirical distribution function of the 

1=1 ~ 

i.d.d. random variables X^,..., X^ ~ F„. Then 

lim sup |F„,„(x) — F(x)| = 0 P — a.s.. (4.12) 


Proof We write \\Fm,n — F|loo := sup \Fm,n{x) — F(x)| for ease of notation. 

a:6R 

Note that the Glivenko-Cantelli Theorem implies that lim IlFm « — F„ ||oo = 0 

m^co ' 

P-a.s., for all n G N. By the 2 ^'^ auxiliary result above we know that V/c G N 
there exists N(/c) G N such that 

||F„-F|U<p Vn>N(/c). 
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Using these two facts, we get that for all n > N{k): 

1 

lim ||-Fj77 — lim -fnlloo T lim \\Fn f"||oo ^ y P u.s.. 

m^oo m—>00 m^oa /C 

'->,-' '-V-" 

^ 0 P-a.s. < i 

— k 

So lim \\Fn,n — F\\ CO < 1 P-a.s. Letting A: —> 00 then proves the assertion. □ 

n>N{k) 

Proof (Theorem 4.4) The proof of Theorem 4.4 is obtained by induction 
from the bottom to the top of the tree. The Ff for leaf nodes I E ^ are 
obtained through a simulation from the true distributions F;. Therefore, the 
Glivenko-Cantelli Theorem directly yields (4.10) for 1 G 


Fix J G Suppose that (4.9) and (4.10) hold true for (J, 1),..., (1, Nj). 
Recall the recursion procedure in step 2 of the MRA together with the sec¬ 
ond remark in (4.3) and note that the MRA yields n independent sets of 
random vectors 


.notu.d, 

...» ~ Gf not i.i.d. 


(4.13) 


Note In practice, the MRA yields realizations of these random variables. 
Concretely, realizations of {i,k) G {1,..., n}^, are given by 

i^{k:n) _ (1^^ iY‘l7,2FPi,2Fp7,l(‘ti,i(k)))) £ ^‘]7 n/pi.n,{‘p7iF‘1iA’^))))\ 

Yj .— I Aj2 ,..., I. 

To avoid that the complicated form distracts us from the idea of the proof, 
we decided to mention this just as a side note. 

We stress that all G (£,k) G {1,.. ■ ,n}^, indeed have the same 

distribution, which we denote by Gj". The *n is used to distinguish the 
distribution function Gj» from the empirical distribution function G». 


Denote by ^G», £ = 1,... ,n, the empirical distribution function of the identi¬ 
cally but not independently distributed random vectors 

These random vectors ^Yf'"\... were reordered according to Ar- 

benz' reordering algorithm (except for the different reordering order). Hence, 
we can apply Lemma 3.6 in [1] in combination with the induction hypothesis, 
which tells us that for all £ = 1,..., n it holds that 


lim 

n^oo 


sup 


(^) ~ G/(^) 


= 0 P — a.s. 


(4.14) 
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Recall step 3 in the MRA and the related second remark in (4.3): The MRA 

picks the the diagonal elements in (4.13). We 

define 


._ 1 

^ I ■ — 



Hence, the MRA yields i.i.d. random vectors ~ Gj”. Note 

that the empirical distribution function of the random vectors ..., y|” ”^ 

is G'f from (4.7). Hence, it remains to show that G” converges to G/ in the 
sense of (4.9). 

(k'fi } 

We prove this by showing that Yj'^ Gj”, k = 1,... ,n, converges weakly 
towards Gi, i.e. we need to show that 

lim Gj"{x) = G[(x), Vx G at which Gj is continuous. (4.15) 


If (4.15) holds true, we can apply the 3''‘^ auxiliary result 4.2.1 which tells 
us that the empirical distribution function G” of the i.i.d. random vectors 
Yf""\... ~ G|” converges towards G/ in the sense of (4.9). 

Moreover, note that Xj ="sum of the components of Y) ' ", k = 1,.. .,n, 
and Ff is the empirical distribution function of the random variables X|,..., X” 
(recall 4.8). We can therefore apply the Cramer-Wold theorem and then ar¬ 
gue again as above with the 3’"'^ auxiliary result to prove (4.10). 


It remains to show that (4.15) is true, which we prove by contradiction: As¬ 
sume 3x G R^^ such that 3e > 0 s.t. Vn G N 3N{n) > n: |Gj^^”^(x) — 
G]{x) I > £. Fix an arbitrary £ G {1,..., n}. Then 


E[\Gi{x)-^G^^"\^ 


>|£[G,(x)-^Gf^”\x)]| 

> \E[Gi{x)]-EfG'^^"\x) 


N{n) 


This is a contradiction because ^Gj 
by dominated convergence £[|G;(x) — ^G^^^\x 


= |gkx)-g;^^”)(x)| >£ 

Gj{x) P-a.s. by (4.14) and so 


G 0. 


□ 
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4.2.2 Convergence towards the tree dependent distribution 

In the previous section we showed that in case of discrete marginals the 
MRA at least has the same convergence properties as the original algorithm. 
We will now go a step further and claim that in case of discrete marginals the 
MRA yields approximations of the unique tree dependent random vector. 





Figure 4.1: An illustration of the aggregation tree model (4.16). 


In order to keep the proof of this claim as comprehensible as possible we de¬ 
cided to restrict ourselves to one of the simplest tree structures and refer to 
the Appendix A where we discuss an extension to more general aggregation 
trees. More precisely, we will consider the aggregation tree model 

(t/ / (4.16) 

with T = {0, (1), (1,1), (1,2), (2)}; three univariate distribution functions 
Fi 1 , Fi 2 , F 2 of discrete random variables ; and two bivariate copulas Ci and 
C 0 satisfying the assumptions of Theorem 3.3. Figure 4.1 illustrates the 
aggregation tree model (4.16) for which we will prove the following theorem: 


Theorem 4.5 Consider the aggregation tree model defined in (4.16). Let = 
Xi^ 2 , X 2 ) be the unique tree dependant random vector associated with this 
model and denote the distribution function of X^ by Ft. 

Then, the MRA yields realizations of Ltd. reordered random vectors which all 
satisfy the conditional independence assumption (2.3) and additionally converge in 
distribution towards the unique tree dependent random vector X^. In particular, if 
Ff denotes the empirical distribution functions of these reordered random vectors, 
then it holds that 

lim sup IF”(x) — Ft(x)| = 0 P — a.s. (4.17) 

”^“s:eR3 
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Preliminaries 

From now on we will think of all the samples appearing in the MRA 4.2 not 
as samples anymore but as random variables. It should be clear that in prac- 



4.2. Convergence results 


tice the MRA then yields realizations (samples) of these random variables. 
Our aim will be to gain a better understanding of the expressions (4.3) and 
(4.4) in the 2^'^ step of the MRA. 

We stress first that the ranks of a set of i.i.d. random variables are as well 
random variables. Let for instance 7},... ,Z" he a sequence of i.i.d. random 
variables. Recall that the rank of within the set is defined as: 

7=1 

Expression (4.18) is clearly a random variable. Thus, (4.3) and (4.4) must as 
well be random variables. 


In the present compact form used in the MRA it seems difficult to talk 
about properties of the expression (4.4). We therefore propose a different 
representation of (4.4). The following short example illustrates the idea. 


Example 4.6 Let Z^,Z^ i.i.d. and denote by the rank ofZ^. Define the permu¬ 
tation p{k) = R^, k = 1,2. Let further V^,V^ i.i.d. be some random variables. 
Assume the random variables X^,X^ are defined as follows: 

Xi ;= pp(i)^ := 


Observe that we could alternatively write 

X^ = R^1{Z1<Z2} + ^^1{Z1>Z2}/ 
= l^^l{zi<z2} + ^^l{zi>z2}- 


(4.20) 


Obviously, (4.19) is a compact version of (4.20). Representation (4.20) has 
the advantage that it seems easier to analyse its distributional properties. 
Analogously to (4.20), we will now give an extended representation of 

W1 lY^'70,2{V0,2(V0,l{^fel(l)))) 

A.2 


for n = 2. This is expression (4.4) in our aggregation tree model (4.16) with 
t = 1, k = 1 and n = 2. We focus on the second component only: 


1^^'70,2(>0,2(>04(^'70,i(1)))) ^ 








+ ^X2l 


{ix;<ix?}n{iij^^<iu|^}n{iu^^<iij|"}n{ix^<ix^}- 
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4. Tree dependent sampling 


Note that in the above sum appear 2^ = 16 different indicator functions, 
and so the sum consists of 16 summands. In case of n = 3, the sum would 
already comprise (3!)^ = 1296 different summands. 

For the proof of Theorem 4.5 it is important to note that - no matter how 
large we choose n - we can always write the above sum as a random function 
f(}Xi) in^X^. For n = 2 we have, for instance. 


/(^) 






{;c<ix?}n{iu^^<iu|'}n{i(J^"<iu|"}n{ix^<ix^}- 

The random function f{x) consists of the random variables ^X^, ^X^, ^Xj, 

, ^X 2 , ^X 2 which are all independent of ^x\, and hence 
f{x) is independent of ^xj. 


Auxiliary result 4 Based on the 1®* auxiliary result 4.2.1 we state the fol¬ 
lowing extension: Let the discrete random vectors {X„,Y„) (X/ con¬ 

verge in distribution and assume again that there exists a discrete set E C 
which contains the ranges ran(X,;, ¥„) and ran(X, Y) . Fix {x,y) G K x ranY, 
then 


Jim E[ll{x„<x}|’'^n =y] = E[l{x<x}|’'^ = y]- 

Note that for the proof we can proceed analogously as in the proof of the 
1®* auxiliary result 4.2.1. We do not want to dwell on this any further and 
instead proceed with the proof of our main Theorem 4.5. 


Proof (Theorem 4.5) Due to the notational complexity we strive for a clear 
and logical organization. We strongly suggest to grasp the "big picture" first 
before getting lost in technical details. 

Denote by X^ '”^ = 


X 


(l:n) 

1,1 


yr{l:n) yr{l:n) 
■^1,2 ' "^2 


„(n:n) _ / ^{n-.n) ^{n:n) „ 

, -A — 1 ^ 2 / -^2 


{n:n) 


the reordered random vectors (4.6) obtained in step 3 of the MRA. More pre¬ 
cisely. 


X^ are given by 


/ yiV.n) y{l:n) Y(l:n)^ 

^^Aii ,A ^2 J 


■■=K = 


1y 1 lY^V2(V0,2(V0,\d‘?0,l(l)))) 

A]^, A 2 


/ Y(ti:n) y{n:n) y{n:n)\ _„„ 

^Aii ,Aj2 /A 2 J .— A0 



n ^ ‘?0,2 (" P0,2 (" P0,\ (" ‘70,1 (»?)))) ^ 
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4.2. Convergence results 


Our overall goal is to show that distribution. By the 

auxiliary result 4.2.1 we can then immediately conclude that the empir- 
ical distribution function F" of the i.i.d. random vectors X^' ,...,X 0 ' 
converges towards in the sense of (4.17). 


Step 1 We show that X^satisfy the conditional independence 

assumption (2.3). Since are i.i.d. it is enough to show that 

(2.3) holds for X^'"^ For this purpose we introduce the random variable 

= X^^^^ + X^^^\ Note that for X^ ”^ the conditional independence 
assumption then reads: 

(xg”^,x{)2”^'^r”') ^ + X^^^"^) I xf (4.21) 

The discussion in the Preliminaries 4.2.2 is crucial for the proof of (4.21). We 
argued that X^'"^ can be written as a function in X^^ ”^ by X^^ ”^ = f{X^’"^), 
where f{x) as a function in x is a random function consisting of random 
variables that are all independent of 

Let g : > K and li : > R be two measurable and bounded func¬ 

tions. 

= E[^(xg”),xg”Uf''V(/(xf”)),xj^^") + /(xf”))) I 
= E[g{x[f \x[f\x)h{f{xlx + f{x)) I Xf”)] 


Using that h{f{x),x + f{x)) is a function of random variables that are all 
independent of xj^^”^,xj 2 ”^ X^'"\ we proceed by 


'x=X- 


(l:n) 


= E[h{f{x),x+f{x))] £[^(x;;-”Ui;r^^) i 

= E[E(/(xf”)),xf") + /(Xf”))) I xf'^)]£[g(xg'^^xg’^),xr”^) I Xf”) 
= E[h{X^2 '-"\A^''"^ +^2 '"^) I Xf^”^]£[g(xj)-"),x[2”^'^r”') I 

This proves (4.21) and hence the first step. 


Step 2 In the second and last step we prove that the reordered random 
vector X^ '^^ = ^x(^^”^,xj 2 ”^^ 2 ^ converges in distribution towards the 
unique tree dependent random vector X 0 = (Xi i,Xi 2 , X 2 ). 
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4. Tree dependent sampling 


Recall that we know from Theorem 4.4 that approximates the dis¬ 

tributions Gj and Fj, J G More precisely, with Xi = Xi i -|- Xi 2 it holds 
that 


1 , 2 

\-^2 ’ '^\ 


^ (Xi,i,Xi,2,Xi), 

) (X 2 , Xi) in distribution. 


(4.22) 

(4.23) 


Let E = {ranXi 4 ,ranXi^ 2 /ranX 2 } C IR be the discrete set comprising the 
ranges of X 14 , Xi ^2 and X 2 . Note that |ranXj^^”\ranX| 2 ”^,ranX 2 ^ ”^| C E, 
for all n G N. Then 


Xi : (a^,P) ^ 

Xf'^) : (a<P) ^ {E,P{£)), 

where ViS) is the power set of E. Denote by ^ = P o X^^ and ji„ = 
P o the push-forward measures of P by Xi, respectively xj^ ”^ 

By (4.22), the sequence of measures }i„ on {E,V{£)) converges weakly to¬ 
wards the measure ji. 


Keeping this in mind, we are now ready to prove the main claim. Let 
{Xl,l,Xi,2,X2) G R^. 


P[Xi,i 
= e\ 


< ^l,l/Xi^2 < M,2/X2 < X2] — P[l{Xi,i<Xi,i}l{Xi,2<Xi,2}l{X2<X2}] 

P[l{Xy<Xi,i}l{Xi,2<Xi,2}l{X2<X2} I ^l] 


Using that X 0 satisfies the conditional independence assumption (2.3), we 
get 


= E 


P[l{Xy<Xy}l{Xi.2<Xi,2} I ] £ [11{X2<X2} I 
= / ^[^{Xii<xii}l{Xi 2 <xi 2 } I Xi = x]£[ 1 {X 2 <x 2 } I Xi = x]d}i{x) 

V-- 

=: h{x) 


Note that h{x) := £[l{Xi,i<xi,i}l{Xi,2<xi,2} I Xi = x]£[ll|X2<x2} | Xi = x] is a 
function in x mapping from £ to R. In fact, £:£—)■ R as such is continuous 
and bounded. So the Portmanteau-Theorem implies 


= lim 

m^co 


^£[l{Xy<xi,i}l{Xi,2<xi,2} I Xi = x]£[1{X2<x2} I Xi =x]d^„^{x) 




Leth„{x) P[l|xG")<xy}^|x<)'"><xi, 2 } ^ ^ “ ^l^[^{xf^")<x 2 } I 

The functions h„ : E ^ IR are continuous and bounded and the 4* auxiliary 
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4.2. Convergence results 


result 4.2.2 implies that lim h„(x) = h(x). Therefore, by dominated conver- 

n^oo 

gence 


= lim lim 

m^co n^co 




=: h„{x) 


= x] djim{x) 


We claim (proof below) that the iterated limes lim lim L hn(x)du^{x) equals 





= lim E 

n^co 


£|1, 




)rding to the first step 
the conditional independence assumption (4.21) and so 


X 


(l:n)n 


Note that according to the first step, j satisfies 


^[^{xg”><Xi,i}^{x<)2"><Xi,2}^{xf"><X2} I 

= lim P[x\\”^ < Xi,i,x{ 2 ”' < Xi, 2 ,xf < X 2 ]. 


(l:n)i 


This proves the second step and hence the whole Theorem 4.5. It only re¬ 
mains to show that the above claim holds true. Let us formulate the claim 
once again for the sake of clarity. 


Claim Let h and h„, n G N, be continuous and bounded functions (||l2||oo, 
ll^nlloo < 1) mapping from a discrete set E to IR and suppose that the se¬ 
quence h„ converges pointwise towards h. Let F and Fm, m G N, be distribu¬ 
tion functions of discrete random variables on £ and denote the associated 
measures on £ by /i, respectively fim- Suppose Fm converges uniformly to¬ 
wards F as in (4.11). Then the following equality holds: 

lim lim / h„(x)dum(x) = lim / h„(x)dum(x). 
m^oon^coJE {m,n)^ooJE 


Proof Let Fm,n ■= j^hn{x)d}im{^), £ := define Fm,oo '■ = 

lim £m n/ boo n := lim Fm n- Note that dominated converges and the Portmanteau- 

n^oo ' ' m^oo ' 

Theorem imply that £ = lim Fmca = lim Loon- 

m^co ' n^oo ' 
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4. Tree dependent sampling 


We need to show that for any £ > 0 there exists fC G N such that Vm, n > K: 

|L l^m,n I ^ £• 


Because lim Lean = I- we can find fCi G N such that Vn > Ki: \L — Lconl < f • 

Since F is the distribution function of a discrete random variable there ex¬ 
ists a finite set A C E, of cardinality |A| < oo, such that }i{A) > 1 — ^ 
where S is defined as := Together, the 1®* auxiliary result 4.2.1 

and the fact that A is finite, imply that there exists fC 2 G N such that 
Vm > K 2 : \}iix) — }imix)\ < ^, Vx G A. Hence, we can deduce that Vm > K 2 : 
}im{A^) < ^{A‘=) + \A\S <3+\A\3. 

Therefore, we can conclude that for all m > K 2 , n G N: 


1 Eoo,n Lfn^n \ — 

/ hn{x)d}l{x)- / hn{x)d}lm{x) 


Je Je 


< 


hn{x) d{li{x) - lim{x)) 
ll^njlco G 1 


hn{x^ d{ji{x) 


\\h„\\oo < 1 


< Ifl(x) - }im{x)\ + }l{A‘=) + }lm{A^) 

xeA 

< |A|^ + ^ + ^+|A|^ = ^(2|A|+2) = I 


}lm{x)) 


Set K = max {fCi, fC 2 } and note that then Vm, n > K: 


\L-L„ 


< lE-Lo 


^00 n ^ 


m,n I — 2 2 — ^ 


□ 


We have seen that under the assumption of discrete marginals the MRA 
yields approximations of the unique tree dependent random vector. 

The MRA is a modification of the original reordering algorithm proposed 
by Arbenz et al. [1]. We discussed the modifications in the beginning of 
Section 4.1. Due to the complexity and length of the above proof it might 
be difficult to identify the part of it where the modifications play a key role. 
We stress therefore that in particular the proof of the first step critically de¬ 
pends on them. The second step then exploits the fact that the marginals are 
discrete. 


It currently remains open to show that the results also hold for more gen¬ 
eral marginals. In addition, it would be desirable to obtain similar results 
for the original reordering algorithm. The MRA has the disadvantage that 
its algorithmic efficiency decreases significantly with the number of levels in 
the aggregation tree. This is due to the second modification, which requires 
the algorithm to call itself repeatedly on each level to generate independent 


36 
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samples and, thus, leads to an exponential running time (in the number of 
levels). The original reordering algorithm, in contrast, does not suffer from 
this issue and will, therefore, most likely be favoured for practical applica¬ 
tions. 

4.3 A word on the conditional independence assumption 

The conditional independence assumption (2.3) specifies the unique tree de¬ 
pendent distribution for a given aggregation tree model. One should defi¬ 
nitely wonder, however, if this assumption is reasonable in practice. In this 
section we will briefly address this question. 

Testing for conditional independence requires a large amount of data. Un¬ 
fortunately, meaningful historical data is extremely rare in practice, and so 
we consider the following logical reasoning instead: 

Assume the true risk distribution is given by (X, Y, Z) ~ A/”(f/, X) with 


! ° \ 

/ 1 

0.5 



F= 0 , 

X= 0.5 

1 

0 . 

(4.24) 

\ 0 / 

\ 0 

0 

1 / 



We first consider the aggregation tree model showed on the left of Figure 4.2 
and denote by (x^'^q Y^'^qZ^'^i) its tree dependent distribution. It is easy to 
show that for this model the tree dependent distribution is equivalent to the 
true risk distribution, i.e. (x‘‘^i, Z^'^i) ~A/'(fi,X). 



X Y Z X Z Y 


Figure 4.2: Different ways of designing the tree structure. 


Consider now the aggregation tree model on the right of Figure 4.2. Note 
that this model incorporates two bivariate normal copulas Cx,z and Cx+z,y 
describing the dependence structure between X and Z, respectively X -|- Z 
and y. By (4.24), the copula Cx,z must be the independent copula, and the 
copula Cx+z,y has correlation matrix 

( ^ 

Rx+z,y — I _\_ 

V 2V2 




4. Tree dependent sampling 


Let yf<i 2 denote the tree dependent distribution associated with 

this model. A simple calculation yields that ~ 

with 


/ 0 \ /I 0.25 0 \ 

0 , = 0.25 1 0.25 

\ 0 / \ 0 0.25 1 / 


Obviously, the distributions of and (x‘‘^ 2 ,yf‘^ 2 ^ZtA) ^rg not 

the same, and the example clearly shows that the tree dependent distribu¬ 
tion is not invariant to a change of the tree structure. 


In particular, this simple example illustrates that the conditional indepen¬ 
dence assumption should be treated with much caution: The tree dependent 
distribution critically depends on the tree structure. In practice, it will there¬ 
fore hardly ever be an adequate approximation of the reality, unless the tree 
has exactly be designed in such a way that the data satisfies the conditional 
independence assumption. Easier said than done, considering that models 
with hundreds of individual risks are not unusual and meaningful data is 
often rare. 
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Chapter 5 


The space of mildly tree dependent 

distributions 


As we have mentioned already in the first chapters, an aggregation tree 
model ( 2 . 2 ) does in general not uniquely specify the joint distribution of 
all risks. We called those distributions, which fit a given aggregation tree 
model, mildly tree dependent (recall Definition 2.6). 

Our lead Example 1.1 shall once more serve as motivation for this chap¬ 
ter. Assume that - similar as in the Concluding remark 3.3 - an insurance 
company is also interested in the total risk they are taking in Switzerland, 
i.e. the risk S = Xi i + X 24 . As we know by now, the aggregation tree 
model does not fully specify this distribution. Since for instance the co- 
variance Cov(Xi 4 ,X 24 ) can vary depending on which mildly tree depen¬ 
dent distribution we look at, the variance Var(S) = Var(Xi i) -|-Var(X 24 ) -|- 
2 Cov(Xi 4 , X 24 ) will vary as well. 


Lower bound Upper bound 

for Cov(Xj j) for CovfXj j,X, j) 


Cov(Xj],X,,) in the 
tree dependent case 


In the previous chapters we have seen that there is high evidence that the 
sample reordering algorithm yields approximations of the unique tree de¬ 
pendent distribution. However, as illustrated in Section 4.3, it is question¬ 
able if the conditional independence assumption (2.3), which specifies the 
tree dependent distribution, is indeed a reasonable assumption in reality. 

Therefore, it makes definitely sense to think about, for instance, how the 


39 





5. The space of mildly tree dependent distributions 

range of values that Cov(Xi i, X 24 ) can take looks like. How large is it? 
Where does the tree dependent distribution lie in it? How do the different 
input parameters influence this range? 

In this chapter we will address these kind of questions. We will study sim¬ 
ple trees for different sizes and shapes and we determine the space of mildly 
tree dependent distribution in various scenarios. These "toy-models" help 
us to understand the basic phenomena's at play while aggregating risks in 
this way. We believe that any insurer or reinsurer using such a tool should 
be aware of these systematic effects. 

5.1 The three-dimensional Gaussian tree 

In this first section we study extensively the three-dimensional Gaussian tree 
for which analytical results can be derived. The notation from the previous 
chapters is rather inconvenient for our purpose, and so we believe that a 
change of notation is justified. Consider the following model setup. 

Defining the model 

Our three-dimensional Gaussian tree incorporates three normally distributed 
random variables 

Xi ~ X 2 ~ A/”(0,(7|), X 3 ~ A/”(0,(7|), 

which represent the marginal risks. Note that w.l.o.g. we can assume that 
their means equal zero. Suppose further that we are given two bivariate 
normal copulas C 12 and C 0 with correlation matrices 

= ( pi ?) = ( p. 7 ) ■ 

Here C 12 is the dependence structure that we impose between the risks Xi 
and X 2 , whereas C 0 is the dependence structure between the aggregated 
risk Xi -|- X 2 and X 3 . The so defined aggregation tree model is illustrated in 
Figure 5.1. 

Note that with the above information the joint distribution of the random 
vector (Xi,X 2 , X 3 ) is not uniquely specified. If we further assume that 
the joint distribution is multivariate normal, it follows immediately that 
(Xi,X 2 ,X 3 ) ~A/'(f/,X) with 

/ 0 \ / Pl2‘^l‘^2 0-13 \ 

/I = 0 , i: = pl2(^lO-2 (7-| (723 , 

\ 0 / V C731 (732 (7-| / 
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5.1. The three-dimensional Gaussian tree 



Figure 5.1: Illustration of our aggregation tree model. 


where the covariances ni 3 = and (723 = 032 enjoy a certain degree of 
freedom. We are, however, not completely free at choosing ni 3 and ( 723 . A 

simple calculation yields that (733 -|- (723 = P 0 ^crl + cr^ + 2 pi 2 ( 7 in 2 must hold 
in order to guarantee that the dependence structure between Xi -|- X 2 and X 3 
is indeed given by C 0 . A further constraint is that the resulting covariance 
matrix E must be positive semidefmite. 


Determining the space of mildly tree dependent distributions 

Consider the above model where the parameters ( 7 i, ^ 2 , < 73 , pi 2 and P 0 are 
given. Any multivariate normal distribution (Xi,X 2 , X 3 ) ~ J\f{p,E) which 
satisfies the above mentioned constraints 

<^13 + <^23 = P 0 \l-\- Cr 2 + 2pi2(7i(72, (5.1) 

E pos. semidefmite, (5.2) 


is a mildly tree dependent distribution of our aggregation tree model. Our 
aim is to derive an analytic expression for the values the correlation coeffi¬ 
cient pi 3 := corr(Xi,X 3 ) = ^ can take. 

It turns out that the challenging part is to guarantee that the second con¬ 
straint (5.2) is satisfied. We exploit the fact that every positive semidefmite 
matrix E has a Cholesky decomposition, i.e. there exists a lower triangular 
matrix L such that LLJ = E: 


/hi 0 0 \ / ^11 hi hi \ 

LL^ = I hi h2 0 1 j 0 £22 £32 1 

\ ^31 h2 h3 / \ 0 0 f'33 / 


/ 

£ii£21 

£ii£31 \ 

\ '■ ( 

P12(7i(72 

03 \ 

hihi 

£\l -|- ^22 

hihi + h2£32 

= I Pl2<^l<^2 


03 

\ £ii£31 

f^2lf^31 + ^22^32 

%1 + ^32 + ^33 ) 

\ 0-31 

<^32 

<^1 ] 


X. 
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In this sense the problem translates into solving the following system of 
equations for the variables £n, ^ 21 , ^ 22 , ^ 31 , ^ 32 , £ 33 ' 

Lli — (7j 

^11^21 = Pl20'l(^2 

4 l + ^22 = (^2 

^31 + ^32 + ^33 = ^3 ( 5 - 3 ) 

^11^31 + ^21^31 + ^22^32 = Pe}\J O'l + + 2p\20'l0'2 (5.4) 

Note that the last equation (5.4) is the constraint (5.1). The first three equa¬ 
tions can be solved easily and we obtain In = iui, £21 = ±| 0 i 2 C ^2 and 

£22 = “ P 12 )’ where w.l.o.g. it is enough to consider the positive 

solutions only. 

Next, we solve equation (5.4) for £ 32 : 

P ^\ + C ^2 5 “ 2 pi 2 C^l ‘^2 — ^31(^11 + ^21) 

£32 = —^-=: M - v£^i, (5.5) 

1-22 

where u := ^ In+I 21 _ yyg plug ( 5 . 5 ) into equation 

(5.3): 


£l^ + {u-v£^if + £l^ = al, 


and after completing the square and a few simple algebraic operations we 
obtain the following equivalent equation: 


(4i - 


^33 




= 1 . 


1+172 


(5.6) 


Let (xo,yo) := (j^^O), a := + b := yjcr^ - ^ and 

note that equation (5.6) can then be written as 


(£31 — ^ 0 )^ {£33 — yo)^_-. 

^ b^ ~ ' 


(5.7) 


which is the implicit equation of an ellipse with centre coordinates (xq/I/o) 
and axes a and b. 


We can conclude that £31 can take any value in the interval [xq — a,Xo + a], 


and hence pi 3 := 


— +i3_ _ hTsi 


can take any value in 


ai{xo-a) cri{xo+a) 

CTitT-g ' (JjtTg 
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Comprehensive algebraic simplifications of the two endpoints 
and := yield that in their simplest form they read as 


^mtn _ 

P13 — 


^max _ 

P13 — 


P0Pl2O'2+ P 0 O-I \j^2^ Pl2)(l P 0 ) 


-h 2^12^1 (72 V CTf + CT + 210120^1 C"2 


P0Pi 2<72 + P0ni ^ Pl2)(l PV) 


Crf + Cr^+ 2^12^1 (72 A/ + ^2 + 2|0l2C^lC^2 


_ .^mid 

— 'P\3 


_ length 

~-Pl 3 


(71173 


( 5 . 8 ) 

( 5 . 9 ) 


In summary, we can say that for our three-dimensional Gaussian tree (Figure 
5 . 1 ) the correlation coefficient pis := corr(Xi, X3) of a mildly tree dependent 
distribution can take any value in ,p^i^] ■ The midpoint of this interval 
is p™'^ and the length of the interval is given by . 


Note that once we know the values pis (resp. dis) can take, it is easy to 
obtain analogous expressions for the values P23 (resp. (723) can take, simply 
by exploiting the relation ( 5 . 1 ). In this way we get a parametrization of all 
multivariate normal, mildly tree dependent distributions. 


In the following examples we will further study and illustrate the expres¬ 
sions we have just derived for pf^" and pfs”^. 


Example 1 

In this first example we study the influence of the variances cr^, cr^ and (Tj 
on the range [Pi3”/Pi3^^] of possible correlations between Xi and X3. 


Case 1: cr^ 00 We let the variance of Xi go to infinity. It is to expect that 
since Xi will dominate X2, the influence of X2 on the dependence structure 
between Xi -|- X2 and X3 will be negligible, i.e. corr(Xi,X3) corr(Xi -|- 
X-i, X3) = p0. It is easy to check that 

lim p--= lirn pr3“^ = P0, 

(7?^0O (71^00 


which confirms our intuition. Note that since the correlation coefficient as¬ 
sociated with the unique tree dependent distribution (let us denote it by pjg) 


must lie somewhere inside the degenerated interval 
P0, we can deduce that pjj must be equal to p0. 
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Case 2: —>■ oo We let the variance of X2 go to infinity. In contrast to 

the previous case, the influence of Xi on the dependence structure between 
Xi + X2 and X3 can be seen as negligible. Intuitively, we can expect that in 
this case the interval [p™”, p’13^] attains its maximum length. Our intuition 
is again confirmed by the analytical results: 

lim = P0P12 - 7(1 - p?2)(l - pI) 

lim p^^^ = P0P12 + 7(1 - P?2)(l - pD- 

Note that for fixed pi2, P0 the maximum of is indeed y ^(1 — pl2){^ ~ P%)- 

Case 3: erf 00 We let the variance of X3 go to infinity From the expres¬ 
sions ( 5 . 8 ) & ( 5 . 9 ) we see immediately that p^f" and p^^^ do not depend on 
0-3- 

To summarise, it can be noted that the range of values that p^^^ can at¬ 
tain becomes larger if the variance of Xi is small compared to the one of X2. 

If the opposite is the case, then becomes smaller. 

Example 2 

Let for the second example Ui = (72 = ^3 = 1 be fixed and equal to one. We 
plot the difference p^^^ — p'^f" = between the upper and the lower 

bounds for pi3 in terms of pi2 and p0. 

The result is displayed in Figure 5 . 2 . It seems that the space of mildly tree 
dependent distribution increases the more Xi -|- X2 and X3 become uncorre¬ 
lated/independent (p0 —> 0). In the same time it also increases the more Xi 
and X2 become negatively correlated (pi2 —> — 1 ). The maximum is attained 
in (P12/P0) = (“1/0) where it holds that p^-^^ — p^f" = 2, i.e. P13 can take 
any value in [P]3”/Pi3“^] = [“1/1]- We will further study this effect in the 
next example. 

Example 3 

We study the same scenario as in the previous example (ui = (72 = (73 = 1 ). 
This time, we plot the interval [p^3”,p^3^] against p0 for nine different val¬ 
ues of P12. The resulting plots are displayed in Figure 5 . 3 . Note that we 
choose to display also the values of the tree dependent correlation coeffi¬ 
cient p^g. 

It is not surprising that we can observe the same effect as in the previous 
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5.1. The three-dimensional Gaussian tree 


2 ^ 
1.5 ^ 



Figure 5.2: Difference between the upper and the lower bounds for | 0 i 3 in Example 2. 


example. In addition, the illustration also reveals how position and length 
of the interval change in terms of pi2 and p^. 

In case of total correlation between Xi and X2 (pi2 = 1 ), the interval [^^3”, pf^^] 
becomes degenerated and coincides with pjg of the unique tree dependent 
distribution. In fact, [Pi^^'Pn^] seems to be symmetric around i.e. pjg 
is equal to the midpoint p^j‘^ of the interval. A simple calculation confirms 
our assumption. 

We stress that this effect can not be observed in general. For a four-dimensional 
Gaussian tree the equality would, for instance, not hold anymore, 

and so is closer either to the left or to the right endpoint of the interval 

[^min ^maxl 
[Pl 3 J- 


Example 4 

The previous examples have shown that the space of mildly tree depen¬ 
dent distributions may be particularly large under some circumstances. In 
practice, we would probably prefer a model where the space of mildly tree 
dependent distributions is small. Because then the mildly tree dependent 
distributions are concentrated around the true joint risk distribution, and so 
each of them yields already a good approximation of said true risk distribu¬ 
tion. 


In this example we show how the choice of an appropriate aggregation 
tree model can considerably narrow the space of mildly tree dependent 
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-1 -0.5 0 0.5 1 

possible values of p.|3 


Figure 5.3: Illustration of the interval in terms of |Oi 2 and p 0 in Example 3. 


distributions. Let for this purpose the true risk distribution be given by 
(Xi,X 2 ,X 3 ) with 



Assume two insurers (Insurer 1 & Insurer 2) wish to fit a three-dimensional 
Gaussian aggregation tree model. Their approach is, however, not exactly 
the same, as illustrated in Figure 5.4. 


Insurer 1 



Insurer 2 



Figure 5.4: Illustration of the aggregation tree models of Insurer 1 and Insurer 2. 
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5.1. The three-dimensional Gaussian tree 


Insurer 1 Insurer 1 decides to fit an aggregation tree as illustrated on the 
left of Figure 5.4. Hence he imposes a normal copula C 12 between Xi and 
X2 and another normal copula C 0 between Xi -|- X2 and X3. 

From the information available (data & expert opinion) he obtains the fol¬ 
lowing exact estimation of the marginal risks: Xi ~ X 2 ~ 

and X3 ~ A/”(0,1). He further estimates that the normal copulas C 12 and C 0 
have correlation matrices 

R,2 = p j and R0 = ( (, 1 ) . 

With the formulas (5.8) & (5.9) one can easily check that for this aggrega¬ 
tion tree model the space of normal, mildly tree dependent distributions 
becomes maximal. A parametrization of this space would for instance be 
given by Xa ~ Xa) with 


Fa 



a G [—1,1]. 


Insurer 2 Insurer 2 decides to fit an aggregation tree as illustrated on the 
right of Figure 5.4. Hence, he imposes a normal copula C 13 between Xi and 
X3 and another normal copula C 0 between Xi -|- X3 and X2. 

From the information available (data & expert opinion), he obtains the fol¬ 
lowing exact estimation of the marginal risks: Xi ~ X 2 ~ A/^(0,2) 

and X3 ~ A/^(0,1). He further estimates that the normal copulas C13 and C0 
have correlation matrices 

Ki 3 = ^ ^ and R 0 = 

In this case, the aggregation tree model fully specifies the joint distribu¬ 
tion and the space of mildly tree dependent distributions is minimal. The 
only distribution contained in it is the true risk distribution (Xi,X2, X3) ~ 

Note that both approaches are correct and would yield the same distribu¬ 
tion for the aggregated risk Xi -|- X2 -|- X3. The difference is that, although 
both insures need to estimate the same number of parameters, the model 
of "Insurer 2" fully specifies the joint distribution, whereas the model of 
"Insurer 1" does not. 



In summary, we have shown in detail how the space of mildly tree depen¬ 
dent distribution is affected by the different input parameters of the aggre¬ 
gation tree model. The results of this section suggest that the aggregation 
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5. The space of mildly tree dependent distributions 

tree should, whenever possible, be constructed in such a way that the risks 
in the sub-portfolios do have a strong positive correlation. By doing so, the 
space of mildly tree dependent can be considerably narrowed, as Example 4 
has impressively illustrated. 

The idea of grouping together highly correlated risks in sub-portfolios when 
setting up the aggregation tree is not new and has already been proposed by 
Cote & Genest [4] and Bruneton [2]. Their arguments are, however, rather 
of qualitative and practical nature. To our knowledge, the insights gained 
in this section are for the first time providing also a profound mathematical 
justification for this approach. 

5.2 Generalization to non-Gaussian trees 

In this short section we discuss the space of mildly tree dependent distribu¬ 
tions in more general settings. Subject of the first subsection is again the 
three-dimensional tree. In contrast to the previous section we do, however, 
not limit ourselves to Gaussian trees only. In the second subsection we study 
some properties of an eight-dimensional, symmetric aggregation tree. 

5.2.1 The general three-dimensional tree 

Gonsider the same aggregation tree structure as in Figure 5.1. Regarding the 
distributions of Xi, X 2 and X 3 the only assumption is that af = Var(X,) < 00 , 
for i G {1,2,3}. We further assume that the copulas C 12 and C 0 are such 
that corr(Xi, X 2 ) = pi 2 and corr(Xi -|- X 2 , X 3 ) = p 0 . 

Suppose we want to determine the set of covariance matrices 

/ aj Pl2<^lC2 \ 

= j Pl 2 <^l <^2 <^2 C23 I / 

\ C3I C32 C ^3 / 

which are compliant with our model (in the sense that there exists a mildly 
tree dependent distribution with covariance matrix X). Obviously, this is 
exactly the same problem as in the previous section, and hence it all comes 
down to determining those X for which the constraints (5.1) and (5.2) are 
satisfied. 

We can proceed as in the previous section to obtain the same expressions 
(5.8) and (5.9) for the lower and upper bound for pi 3 := corr(Xi,X 3 ). Once 
we have this, it is straightforward to obtain a parametrization for the X's. 

We would like to emphasize that in the previous section, in fact, we never 
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5.2. Generalization to non-Gaussian trees 


used the Normality of the tree. In particular, all the results and examples 
from the previous section are still true and remain the same, irrespective of 
whether the tree is Gaussian or not. 

5.2.2 An eight-dimensional, symmetric tree 

In this subsection we consider a perfectly symmetric tree involving eight 
identically distributed risks with mean 0 and variance 1 , as illustrated in 
Figure 5.5. Note that this model incorporates seven bivariate copulas. We 
assume that each of these seven copulas is such that the correlation between 
the two risks linked through the respective copula is always equal to p G 
[—1,1], for instance corr(X 3 , X 4 ) = p, as well as corr(S 5 , Se) = p. 



The tree dependent distribution 

This special setup allows us to derive an expression for the correlation be¬ 
tween two individual risks in case the conditional independence assumption 
(2.3) is satisfied (also see [4], Example 2). 

Assume (2.3) holds and (Xi,...,X8) is the unique tree dependent distri¬ 
bution. We want to calculate •= corr(Xi,X 3 ). By (2.3) and the tower 
property, 

corr(Xi,X3) = E[XiX3] = £[£[XiX3 | Si,S 2 ]] 

= £[£[Xi I Si,S 2]£[X3 I Si,S2]] 

= £[£[Xi I Si]£[X3 I S 2 ]]. 

Since £[Xi | Si]+£[X 2 | Si] = £[Xi + X 2 | Si] = Xi + X 2 = Si and 
£[Xi I Si] = £[X 2 I Si] by symmetry, we can deduce that £[Xi | Si] = 
Analogously, it holds that £[X 3 | S 2 ] = ^. Hence, 

corr(Xi,X 3 ) = £[yy] = |£[SiS 2 ] = |pVar(Si)Var(S 2 ) = 

Analogously, we get that p^g := corr(Xi,X8) = . In fact, for a 

perfectly symmetric tree involving 2^ identically distributed risks a simple 
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5. The space of mildly tree dependent distributions 


induction argument yields that the correlation between parents of degree 

k G {l,.. .,2^} is —. For p G (—1,1) this directly implies that in the 
limit, the correlation between parents of degree k tends to 0 as /c —> oo. 

The mildly tree dependent distributions 

Let now (Xi,..., Xs) be a mildly tree dependent distribution of our aggre¬ 
gation tree model. Our goal is to determine the lower and upper bounds, 
and for the correlation corr(Xi,X 3 ), as well as the bounds, 
and Pig^, for corr(Xi, Xg). 

The problem reduces to determine the set of covariance matrices satisfying 
a couple of constraints similar to those in (5.1) and (5.2). Again, the main 
difficulty is to guarantee that the covariance matrix of (Xi,.. .,X 8 ) is posi¬ 
tive semidefinite. Unlike the three-dimensional case, analytical results can 
not be derived so easily. 

Fortunately, the problem can be easily formulated as a semidefinite pro¬ 
gramming problem for which sufficiently exact approximations exist. For 
a good introduction to semidefinite programming see [5]. We use the free 
MATLAB software SDPT3 from [9] to solve the semidefinite programming 
problem. 



possible values of p,3 possible values of p,^ 

Figure 5.6: Illustration of the results for the eight-dimensional, symmetric tree. 


In Figure 5.6 we display the results. The light blue plot on the left shows 
the interval [Pi 3 ”,Pi 3 “^], as well as the value of in terms of p G [-1,1]. 
For p < 0, the correlation pi 3 of a mildly tree dependent distributions could 
take any value in [—1,1]. For p > 0 , the length of [p^ 3 ”, p’™^] is decreasing 
in p. 

A similar picture is found in the middle plot where we display the interval 
[Pii"'Pi8^]' value of p(g, in terms of p G [—1,Ij. 

In the right plot we finally display the two previous plots together. It is re- 
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5.2. Generalization to non-Gaussian trees 


markable that the left plot is completely included in the middle plot, i.e. for 
a fixed value of p, corr(Xi,X8) does enjoys more freedom than corr(Xi,X 3 ). 
The result is not surprising: It should be intuitively clear that the depen¬ 
dence structure becomes less and less specified the further away two risks 
are laying from each other in the aggregation tree. Figure 5.6 just confirms 
our assumption. 
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Chapter 6 


Conclusion 


The present thesis provides a self-contained discussion of copula-based hier¬ 
archical risk aggregation. A strong focus is placed on the joint distributions 
associated with an aggregation tree model. 

Chapter 2 and 3 introduced the aggregation tree model as well as the sample 
reordering algorithm developed by Arbenz et al. [1]. The aggregation tree 
model, consisting of a tree structure, copulas, and marginal distributions, 
offers a very flexible way to model risks. The method does not require to 
explicitly specify the dependence structure between all risks, but requires to 
do so only for different (sub-)aggregates and risk classes. As a consequence, 
it does not lead to a unique joint distribution of all risk. This fact is not 
critical as long as the ultimate goal of the user is to develop a model for the 
total aggregate. 

In practice, however, some applications require the joint distribution of the 
individual risk. Against this background, we focused on two main ques¬ 
tions. 

The subject of the first question was the sample reordering algorithm. The 
original algorithm proposed by Arbenz et al. yields samples that approxi¬ 
mate the total aggregate; yet hardly anything is known about the joint distri¬ 
bution approximated by the samples. We have shown that there is numeri¬ 
cal evidence that the samples may approximate the unique tree dependent 
distribution. This has encouraged us to develop a modified reordering al¬ 
gorithm (MRA). Under the additional constraint of discrete marginals the 
MRA yields an approximation of the unique tree dependent distribution. 
The proof mainly exploits the fact that the reordered samples satisfy the 
conditional independence assumption, which specifies the unique tree de¬ 
pendent distribution. It must be said, though, that the practical relevance of 
the MRA is questionable due to its poor algorithmic efficiency when the tree 
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structure contains many aggregation levels. The question whether similar 
results can be proved for the original algorithm remains an interesting topic 
for future research. 

It is currently not sure whether the conditional independence assumption 
is indeed a reasonable assumption in practice. The logical consideration 
of this question in Section 4.3 suggest that the assumption should at least 
be treated with much caution. In this respect, it is important to have a 
better understanding of the space of mildly tree dependent distributions. 
This was the subject of the second question. On the simple example of a 
three-dimensional Gaussian aggregation tree model we illustrated how the 
different parameters of the aggregation tree model affect the space of mildly 
tree dependent distributions. In particular, we showed that the space can 
be considerably narrowed by choosing a favourable aggregation order. The 
systematic effects observed in this and more general examples finally also 
allowed conclusions of the way the aggregation tree should be designed. 
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Appendix 


In Section 4.2.2 we proved that when the MRA is applied to a 3-dimensional 
aggregation tree with discrete marginals, it yields samples that approximate 
the unique tree dependent distribution. 

In this appendix we briefly discuss how the result can be generalized for an 
arbitrary aggregation tree. We stress that what follows is indeed a discus¬ 
sion rather than a complete and rigorous proof. 

Let be an arbitrary aggregation tree model with 

discrete marginals. We denote by X 0 := its unique tree depen¬ 
dent distribution and for I E ^,Xi := denotes the restriction of 

X 0 on the components We want to show that the MRA-reordered 

random vectors converge in distribution towards the unique tree dependent 
distribution associated the model. The idea is to use induction from the bot¬ 
tom to the top of the tree. 

Fix hence I E SS and denote by Xj ,X\ ’ ^ the reordered random vec¬ 

tors (4.6) obtained in step 3 of the MRA for the branching node J. The 
induction hypothesis implies that Xj,- in distribution, for k G 

,n} and z = 1 ,..., Nj. 

Recall that we divided the proof of Theorem 4.5 into two steps. The first step 
was to show that the X^ ”^,... ,x[” ”^ satisfy the conditional independence 
assumption. We will assume for the moment that this is true. The second 
step can be easily generalized for an arbitrary aggregation tree model. The 
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crucial point is to note that 


P[Xj <xj:]e = E [£[1{X;<X;}: / e ^^{I) I X;,!,.. 


= E 


N[ 

I : / G I X;,^] 


due to the conditional independence assumption and the tower-property. 
Then we can proceed as in the second step and use the induction hypothesis 
to obtain the desired result. 


The extremely tedious part is to show that indeed satisfy 

the conditional independence assumption. Recall that this was already nota- 
tionally involved for the simple 3-dimensional tree. Although we are more 
than confident that the result can be generalized, it was not the subject of 
this thesis to provide a rigorous proof thereof. 
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